[MultiThreshold] Generalize data layouts for node execution #143

iksnagreb · 2024-09-13T13:44:33Z

The relevant aspect of the data layout annotation seems to be which axis is labeled as the channel dimension "C": We do not actually have to care about the total number and ordering of the other axes, as long as we can find the index of the "C" axis and swap to have "C" at index 1 for node execution (and swap it back afterwards).

Falls back to the default assumption that "C" is at index 1 if there is no layout annotation, which is equivalent to the "NCHW" or "NC" layouts.

This is a rather experimental change which might break existing code and is currently still restricted to the well-known 2-, 3- and 4-dimensional layouts.

This PR is based on #92 which is less experimental and should be merged first (#92 also does not risk breaking existing code as it only adds new special cases).

This allows node execution of MultiThreshold operators with arbitrary number of dimensions, as long as the channel dimension is last. This is necessary to run some verification steps of attention operators which, at least for some intermediate steps, have 3 dimensional data layouts. This does not change the behavior of execution on the already existing 2d and 4d data layouts.

The relevant aspect of the data layout annotation seems to be which axis is labeled as the channel dimension "C": We do not actually have to care about the total number and ordering of the other axes, as long as we can find the index of the "C" axis and swap to have "C" at index 1 for node execution (and swap it back afterwards). Falls back to the default assumption that "C" is at index 1 if there is no layout annotation, which is equivalent to the "NCHW" or "NC" layouts. This is a rather experimental change which might break existing code and is currently still restricted to the well-known 2-, 3- and 4-dimensional layouts.

As we only really care for the index of the "C" axis there is no need to restrict the set of valid layouts here.

Note: Only covers data layouts for tensors with less than five axes

maltanar · 2024-12-17T13:06:17Z

Instead of relying on data layout strings, how about we switch to an attribute axis (like many standard ONNX ops do) to indicate the location (index) of the channels axis? I'd actually prefer to deprecate the old data_layout attribute, and I think we can still keep backwards compatibility by treating data_layout=NCHW as axis=1 and data_layout=NHWC as axis=-1. If the interpretation of the two attributes disagree, the axis one can dominate.

See fastmachinelearning/qonnx#143 for the similar generalization applied to QONNX MultiThreshold

iksnagreb added 4 commits December 13, 2023 17:13

Merge branch 'main' into fix/multi_threshold_exec_layouts

68c7cc5

[Test] Add more allowed data layouts for MultiThreshold

4941c41

iksnagreb mentioned this pull request Sep 13, 2024

Make quantized activation handlers data layout aware Xilinx/finn#1183

Open

iksnagreb added 3 commits October 10, 2024 16:36

[MultiThreshold] Remove set of allowed data layouts

817a23e

As we only really care for the index of the "C" axis there is no need to restrict the set of valid layouts here.

[Test] Remove "allowed values" check for data layouts CustomOp

9d73e16

[MultiThreshold] Replace default data_layout by fallback in execute_node

c0b4534

Note: Only covers data layouts for tensors with less than five axes

This was referenced Jan 20, 2025

Make quantized activation handlers data layout aware eki-project/finn-plus#8

Merged

[Deps] Update qonnx to track https://github.com/iksnagreb/qonnx eki-project/finn-plus#9

Merged

iksnagreb added a commit to iksnagreb/finn that referenced this pull request Mar 3, 2025

[Thresholding] Generalize data layouts for node execution

d86ec13

See fastmachinelearning/qonnx#143 for the similar generalization applied to QONNX MultiThreshold

This was referenced Mar 3, 2025

[Thresholding] Generalize data layouts for node execution Xilinx/finn#1289

Open

[Thresholding] Generalize data layouts for node execution eki-project/finn-plus#50

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[MultiThreshold] Generalize data layouts for node execution #143

[MultiThreshold] Generalize data layouts for node execution #143

iksnagreb commented Sep 13, 2024

maltanar commented Dec 17, 2024

[MultiThreshold] Generalize data layouts for node execution #143

Are you sure you want to change the base?

[MultiThreshold] Generalize data layouts for node execution #143

Conversation

iksnagreb commented Sep 13, 2024

maltanar commented Dec 17, 2024